ROCsearch in a Wider Context — A ROC-Guided Search Strategy for Subgroup Discovery and Beyond

نویسندگان

  • Wouter Duivesteijn
  • Marvin Meeng
  • Arno Knobbe
چکیده

ROCsearch is a ROC-based beam search variant, initially developed for Subgroup Discovery (SD). In ordinary beam search, on each search level, a fixed number of best-scoring candidates are selected to generate candidates for the next search level. This fixed number, the beam width, is typically hard to set, and its setting strongly influences the outcome of the mining process. In ROCsearch, however, on each search level, the beam width is set dynamically by analyzing the intermediate results in ROC space. Thus, setting the beam width parameter is taken out of the domain expert’s hands, lowering the threshold for using Subgroup Discovery. Also, ROCsearch automatically adapts its search behavior to the properties and resulting search landscape of the dataset at hand. In Subgroup Discovery, ROCsearch has been shown to be an order of magnitude more efficient than traditional beam search, while its results are equivalent and on large datasets even better than traditional beam search results. However, ROCsearch has not been investigated beyond SD, while it should be readily applicable in machine learning and data mining outside of the SD subfield. As work in progress, we propose this wider outlook for ROCsearch.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ROCsearch - An ROC-Guided Search Strategy for Subgroup Discovery

Subgroup Discovery (SD) aims to find coherent, easy-to-interpret subsets of the dataset at hand, where something exceptional is going on. Since the resulting subgroups are defined in terms of conditions on attributes of the dataset, this data mining task is ideally suited to be used by non-expert analysts. The typical SD approach uses a heuristic beam search, involving parameters that strongly ...

متن کامل

Development of an Efficient Hybrid Method for Motif Discovery in DNA Sequences

This work presents a hybrid method for motif discovery in DNA sequences. The proposed method called SPSO-Lk, borrows the concept of Chebyshev polynomials and uses the stochastic local search to improve the performance of the basic PSO algorithm as a motif finder. The Chebyshev polynomial concept encourages us to use a linear combination of previously discovered velocities beyond that proposed b...

متن کامل

A GUIDED TABU SEARCH FOR PROFILE OPTIMIZATION OF FINITE ELEMENT MODELS

In this paper a Guided Tabu Search (GTS) is utilized for optimal nodal ordering of finite element models (FEMs) leading to small profile for the stiffness matrices of the models. The search strategy is accelerated and a graph-theoretical approach is used as guidance. The method is evaluated by minimization of graph matrices pattern equivalent to stiffness matrices of finite element models. Comp...

متن کامل

Going beyond the Hero in Leadership Development: The Place of Healthcare Context, Complexity and Relationships; Comment on “Leadership and Leadership Development in Healthcare Settings – A Simplistic Solution to Complex Problems?”

There remains a conviction that the torrent of publications and the financial outlay on leadership development will create managers with the skills and characters of perfect leaders, capable of guiding healthcare organisations through the challenges and crises of the 21st century. The focus of much attention continues to be the search for the (illusory) core set of heroic qualities, abilities o...

متن کامل

Expert-Guided Subgroup Discovery: Methodology and Application

This paper presents an approach to expert-guided subgroup discovery. The main step of the subgroup discovery process, the induction of subgroup descriptions, is performed by a heuristic beam search algorithm, using a novel parametrized definition of rule quality which is analyzed in detail. The other important steps of the proposed subgroup discovery process are the detection of statistically s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014